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Abstract 

A quantum compiler is a software program for decomposing ("compiling") an arbi- 
trary unitary matrix into a sequence of elementary operations (SEO). The author of 
this paper is also the author of a quantum compiler called Qubiter. Qubiter uses a 
matrix decomposition called the Cosine-Sine Decomposition (CSD) that is well known 
in the field of Computational Linear Algebra. One way of measuring the efficiency 
of a quantum compiler is to measure the number of CNOTs it uses to express an 
unstructured unitary matrix (a unitary matrix with no special symmetries). We will 
henceforth refer to this number as e. In this paper, we show how to improve e for 
Qubiter so that it matches the current world record for e, which is held by another 
quantum compiling algorithm based on CSD. 
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1 Introduction 



In quantum computing, elementary operations are operations that act on only a few 
(usually one or two) qubits. For example, CNOTs and one-qubit rotations are ele- 
mentary operations. A quantum compiling algorithm is an algorithm for decomposing 
("compiling") an arbitrary unitary matrix into a sequence of elementary operations 
(SEO). A quantum compiler is a software program that implements a quantum com- 
piling algorithm. 

Henceforth, we will refer to Ref. PQ as Tuc99. Tuc99 gives a quantum compiling 
algorithm, implemented in a software program called Qubiter. The Tuc99 algorithm 
uses a matrix decomposition called the Cosine-Sine Decomposition (CSD) that is 
well known in the field of Computational Linear Algebra. Tuc99 uses CSD in a 
recursive manner. It decomposes any unitary matrix into a sequence of diagonal 
unitary matrices and something called uniformly controlled U(2) gates. Tuc99 then 
expresses these diagonal unitary matrices and uniformly controlled U(2) gates as 
SEOs of short length. 

More recently, two other groups have proposed quantum compiling algorithms 
based on CSD. One group, based at the Univ. of Michigan and NIST, has published 
Ref. [2], henceforth referred to as Mich04. Another group based at Helsinki Univ. of 
Tech. (HUT), has published Refs.fH]. and jl], henceforth referred to as HUT04a and 
HUT04b, respectively. 

One way of measuring the efficiency of a quantum compiler is to measure the 
number of CNOTs it uses to express an unstructured unitary matrix (a unitary matrix 
with no special symmetries). We will henceforth refer to this number as e. Although 
good quantum compilers will also require optimizations that deal with structured 
matrices, unstructured matrices are certainly an important case worthy of attention. 
Minimizing the number of CNOTs is a reasonable goal, since a CNOT operation (or 
any 2-qubit interaction used as a CNOT surrogate) is expected to take more time 
to perform and to introduce more environmental noise into the quantum computer 
than a one-qubit rotation. Ref. |5| proved that for unitary matrices of dimension 2 Nb 
(Nb = number of bits), e > 1(4^ — 3N B — 1). This lower bound is achieved for 
Nb = 2 by the 3 CNOT circuits first proposed in Ref. [U]. It is not known whether 
this bound can always be achieved for Nb > 3. 

The Mich04 and HUT04b algorithms try to minimize e. In this paper, we 
propose a modification of the Tuc99 algorithm which will henceforth be referred to 
as Tuc04. Tuc04 comes in two flavors, Tuc04(NR) without relaxation process, and 
Tuc04(R) with relaxation process. As the next table shows, the most efficient algo- 
rithm known at present is Mich04. HUT04b performs worse than Mich04. Tuc04(R) 
and Mich04 are equally efficient. 
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Caveat: Strictly speaking, the efficiency of Tuc04(R) as listed in this table is 
only a conjecture. The problem is that Tuc04(R) uses a relaxation process. This 
paper argues, based on intuition, that the relaxation process converges, but it does 
not prove this rigorously. A rigorous proof of the efficiency of Tuc04(R) will require 
theoretical and numerical proof that its relaxation process converges as expected. 



2 Notation 

This paper is based heavily on Tuc99 and assumes that the reader is familiar with 
the main ideas of Tuc99. Furthermore, this paper uses the notational conventions of 
Tuc99. So if the reader can't follow the notation of this paper, he/she is advised to 
consult Tuc99. The section on notation in Ref. [Zj is also recommended. 

Contrary to Tuc99, in this paper we will normalize Hadamard matrices so that 
their square equals one. 

As in Tuc99, for a single qubit with number operator n, we define Po = n 
and P\ — n — 1 — n. If R = (ki, k 2 , . . . , hn k ) labels Nk distinct qubits and b = 
(&i, 6a, ... , b Nli ) E BooI Nk , then we define P$t) = P bl (Ki)P h2 (K 2 ) . . . Pb Nlc {^N K )- 

When we say A (ditto, A') is B (ditto, B'), we mean A is B and A' is B' . 

For any complex number z, we will write z = \z\e l ^( z \ Thus, \z\ and L{z) are 
the magnitude and phase angle of z, respectively. 

G x ,Gy, e z will denote the unit vectors along the X, Y, Z axes, respectively. For 
any 3d real unit vector s, a s = a ■ s, where a = (a x ,cry,a z ) is the vector of Pauli 
matrices. 



3 [/(iV)-subsets and ?7(iV)-Multiplexors 

We define a ?7(iV)-subset to be an ordered set {Ub}yb of iV dimensional unitary 
matrices. Let the index b take values in a set with N b elements. In this paper, we 
are mostly concerned with the case that S& = BooI Nk , and b is represented by b. 

Suppose a qubit array with N B qubits is partitioned into N T target qubits and 
N K control qubits. Thus, N T , N K are positive integers such that N B = N T + N K . Let 
k = (aci, k 2 , ■ ■ ■ , Knk) denote the control qubits and r = (t 1; t 2 , . . . , t Nt ) the target 
qubits. Thus, if f and k are considered as sets, they are disjoint and their union is 
{0, 1, . . . , Nb — 1}- Let {Ui\^ Boo1 n k be an ordered set of operators all of which act 
on the Hilbert space of the target qubits. We will refer to any operator X of the 
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following form as a uniformly controlled [7(2 ArT )-subset, or, more succinctly, as a 
t/(2 7VT )-multiplexor: 

x= e wwo= n 

beBool N K b£Bool N K 

("multiplexor" means "multi-fold" in Latin. A special type of electronic device is 
commonly called a multiplexor or multiplexer). Note that X is a function of: a set 
« of control bits, a set r of target bits, and a f/(2 Arr )-subset {U^}^ eBool N K ■ FigJT] 
shows two possible diagrammatic representations of a multiplexor, one more explicit 
than the other. The diagrammatic representation with the "half moon" nodes was 
introduced in Ref. [3]. 



bit 
labels 




Figure 1: 2 Diagrammatic representations of a [7(2^ T )-multiplexor. 



For a given L r (2)-subset {Ub}\/b (and for any multiplexor with that £7(2)- 
subset), it is useful to define as follows what we shall call the optimal axis of the 
?7(2)-subset. Suppose that we express each Ub in the form 

U b = e i V az V (CTs i a!,+<Ts 2 /3b) (2a u ,) /(b) , (2) 

where r]b, a&, Pb, 76 are real parameters, where the vectors Si,§2, and w = §i x §2 
are orthonormal, and where f(b) is an indicator function which maps the set of all 
possible b into {0, 1}. Of course, e vr]b = y/det Ub- Appendix 1X1 shows how to find the 
parameters ab,Pb,1b for a given Ube~ ir,b G SU{2). Appendix IB1 solves the following 
minimization problem. If the value of the parameters a&, /3&, 75 and the vectors s\, §2, w 
are allowed to vary, while keeping the vectors §i,S2,w orthonormal and keeping all 
Ub fixed, find vectors §x,S2,w that are optimal, in the sense that they minimize a 
cost function. The cost function penalizes deviations of the diagonal matrices e lJbaz 
away from the 2d identity matrix 12- Any choice of orthonormal vectors §1, §2 will be 
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called strong directions and w = §\ x §2 will be called a weak direction, or an 
axis of the C/(2)-subset. An axis that minimizes the cost function will be called 
the optimum axis of the U (2) -subset, (an axis of goodness). 

It is also possible to define an optimum axis of a C/(2)-subset in the same way 
as just discussed, except replacing Eq.(J2J) by 



\f( b ) p i ( a 's 1 a b +cT S2 f3 b )i(T z 'y b 



(3) 

In Eq.fl2J), the diagonal matrix e n<Jz is on the left hand side, so we will call this 
the diagonal-on-left (DOL) parameterization. In Eq.(J3j), the diagonal matrix 
e *7o-* j s on ^he rig^ hand side, and we will call this the diagonal-on-right (DOR) 
parameterization. 



4 Tuc04 algorithm 
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Figure 2: Diagrammatic representation of the Cosine-Sine Decomposition (CSD). 



The Cosine Sine Decomposition (CSD) expresses an N dimensional unitary 
matrix U as a product LDR, where L = L ® L x , D = e ia y® & , R = R © R 1 , 
where L , Li, Rq, Ri are unitary matrices of dimension N/2, and B is a diagonal real 
matrix whose entries can be interpreted as angles between subspaces. Note that the 
matrices L, D and R are all multiplexors. Fig|2]depicts the CSD graphically, using the 
multiplexor symbol of Figd In Fig|21 a [/(2)-multiplexor whose £7(2)-subset consists 
solely of rotations around the Y axis, is indicated by putting the symbol R y in its 
target box. We will call this type of multiplexor an i? y (2)-multiplexor. 

Lets review the Tuc99 algorithm. It decomposes an arbitrary unitary matrix 
into a SEO by applying the CSD in a recursive manner. The beginning of the Tuc99 
algorithm for iV^ = 4 is illustrated in FigOl An initial unitary matrix Ui n is decom- 
posed via CSD into a product of 3 multiplexors L, D, R. The L and R multiplexors on 
each side of D are in turn decomposed via CSD. The L and R multiplexors generated 
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Figure 3: Recursive use of CSD to decompose a 2 4 dimensional unitary matrix. 



via any application of CSD are in turn decomposed via CSD. In FigEU we have stopped 
recursing once we reached multiplexors whose target box acts on a single qubit. Note 
that at this stage, Ui n is decomposed into a product of U (2)-multiplexors. There are 
1 + 2 + 4 + . . . + 2 Nb ~ 1 =2 Nb -1 of these U (2) -multiplexors (15 for Nb = 4). Half 
of these t/(2)-multiplexors have R y in their target boxes and the other half don't. 
Furthermore the R y type multiplexors and non-i?^ ones alternate. Furthermore, the 
non-it^ [7(2)-multiplexors have their target box at qubit 0, so, according to the con- 
ventions of Tuc99, they are direct sums of U (2) matrices. The Tuc99 algorithm deals 
with these direct sums of U(2) matrices by applying CSD to each U (2) matrix in the 
direct sum. This converts each direct sum of U(2) matrices into a product LDR, 
where R and L are diagonal unitary matrices and D is an i? y (2)-multiplexor. Thus, 
Tuc99 turns the last operator sequence shown in Fig|3] into a sequence of alternating 
diagonal unitary matrices and i? y (2)-multiplexors. Then Tuc99 gives a prescription 
for decomposing any diagonal unitary matrix into a SEO with 2 Nb CNOTs and any 
^^-multiplexor into a SEO with 2 Nb ~ 1 CNOTs. 
Tuc99 considers what it calls a D-matrix: 

D = eM ]T i* ? ®J*)= £ U t ®P tl (4a) 

beBoo^B- 1 beBoo^B- 1 
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with 



*7 ? =exp(iS ? ), (4b) 

where 

®b = h°v ■ ( 5 ) 

Here 0g is a real parameter. In the nomenclature of this paper, D is an R y {2)- 
multiplexor with a single target qubit at Nb — 1 and Nb — 1 control qubits at 
{0, 1, . . . , N B -2}. Tuc99 shows how to decompose D into a SEO with 2 Nb ~ 1 CNOTs. 
Tuc99 also discusses how, by permuting qubits via the qubit exchange operator, one 
can move the target qubit to any position {0, 1, ... , Nb — 1} to get what Tuc99 calls 
a direct sum of D matrices. In the nomenclature of this paper, a "direct sum of 
D matrices" is just an i?j / (2)-multiplexor with a single target qubit at any position 
out of {0, 1, ... , N B — 1}. In conclusion, Tuc99 gives a complete discussion of R y {2)- 
multiplexors and how to decompose them into a SEO with 2 Nb ~ 1 CNOTs. 

Next, let us consider how to generalize Tuc99. We begin by proving certain 
facts about [7(2)-multiplexors that are generalizations of similar facts obtained in 
Tuc99 for i? y (2)-multiplexors. 

Suppose §i, s 2 and w = §i x s 2 are orthonormal vectors. Suppose we generalize 
the D matrices of Tuc99 by using Eqs.(@]) with: 

$ 6 = ( t ) b,l (J s 1 + 06,2^2 • ( 6 ) 

Here 4>^ x and 0g y are real parameters. In Tuc99, we define to be a column vector 
whose components are the numbers 0g lined up in order of increasing b G Bool 1 ^ 13-1 . 
Here, we use the same rule to define vectors 6 X and <b v from 0r and 6? , respectively. 
In analogy with Tuc99, we then define 9 X and 9 y via a Hadamard Transform: 

0j = , 1 Hn b -\4>3 (7) 
for j G {x, y}. (Hn b -i has been normalized so its square equals one). Let 

Q b = 6 b,i a n + 6 b,2 a s2 ■ (8) 
As in Tuc99, D can be expressed as 

d= n A b, (9) 

bGBoo^B- 1 

where the operators {^4g} V 5 mutually commute, and can be expressed as 

A^expUe^NB-lifla^)] . (10) 
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Next we will use the following CNOT identities. For any two distinct bits a, /3, 

a w (a) n W © a Sl (a) = a Sl (a)a z ((3) , (11) 

and 

a w (a) n ^ © a S2 (a) = a S2 (a)a z ((3) . (12) 

These CNOT identities are easily proven by checking them separately for the two 
cases n{(3) = and n(j3) = 1. By virtue of these CNOT identities, Eq. (jlUj) can be 
re-written as 

A t = MiVB-ir^- 1 )...^^-!^^ • (13) 

As shown in Tuc99, if we multiply the matrices (given by Eq. (jl3j) ) in a Gray 
order in 6, many a w (N B - l) n <0 cancel. We end up expressing D SEO wherein 
one-qubit rotations (of bit Nb — 1) and ct w (Nb — l) n( ^ type operators alternate, and 
there is the same number (2 Nb ~ 1 ) of each. At this point, the <j w (Nb — l) n ^> operators 
may be converted to CNOTs using: 

a w (N B - l) n(/3) = e^**- 1 )-*- © a x (N B - l) n(/3) , (14) 

where e l<T ( N B-i)-dwx j s a one-qubit rotation that takes direction w to direction e x . 

Even for the generalized D discussed here (i.e., for the D with $g defined by 
Eq.©), it is still true that, by permuting qubits via the qubit exchange operator, one 
can move the target qubit to any position {0, 1, ... , iVg — 1}. 

As we have shown, our generalized D matrix can be decomposed into an 
alternating product of one-qubit rotations and CNOTs. The product contains 2 7Vs_1 
(one factor of 2 for each control qubit) CNOTs and the same number of one-qubit 
rotations. This product expression for D will contain a CNOT at the beginning and 
a one-qubit rotation at the end, or vice versa, whichever we choose. Suppose we 
choose to have a CNOT at the beginning of the product, and that this CNOT is 
a w (N B - l) n(M) , for some /j 6 {0,1,..., N B — 2}. Then the matrix D[ia w (N B - l)] nM 
can be expressed with one CNOT less than D, as a product which starts and ends 
with a one-qubit rotation. And D[ia w (NB — l)] n w is a U (2)-multiplexor just as much 
as D is. Indeed, 

[ia w {N B - l)] n <") = ia w (N B -l)n(fi) + n(fi) (15a) 
= ia w (N B - l)P (n) + PM , (15b) 

so 

D[ia w (N B -l)] n M = [^e^^P^ia^NB-l)}^ (16a) 

b 
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= (e i *tia w )®P 1 ;+ <?**®Pl (16b) 

b 

where S (fi) = {b E Bool 1 ^ 13 ^ 1 : = 0} and Si(/i) is the complement of 5 (//). Thus, 
the E/(2)-subset of D[icr w (iV B - l)] n(/l) is the same as that of D except that half of 
the e**& matrices are multiplied by ia w . 

In conclusion, we have pointed out a convenient type of £7(2)-multiplexor. 
The £7(2)-subset of a convenient U (2)-multiplexor consists of matrices of the form 
e l ^t(ia w )^ b \ where $g is given by Eq.© and / is an indicator function that maps 
the set of all b into Bool. A convenient [7(2)-multiplexor can be expressed as a SEO 
with 2 Nb ~ x - 1 CNOTs. 

Next we will give an algorithm that converts a C/(2)-multiplexor sequence such 
as the last operator sequence in Fig|3]into a sequence of convenient £7(2)-multiplexors. 
For defmiteness, we will describe the algorithm assuming Nb = 4. How to generalize 
the algorithm to arbitrary Nb will be obvious. 

1. As in Fig.(jHJ), let Ui n be the matrix to which CSD is initial applied. We assume 
that before we start applying CSD, Ui n has been normalized so that det([/j n ) = 
1. 

2. Apply CSD recursively, as show in Fig|31 Let Tj, where < j < 14, denote the 
15 U( 2) -multiplexors labelled thru 14 in FigOJ Thus, U in = T 14 . . . T 2 T!T . 

3. For now, let {Ub}\/b denote the f/(2)-subset of the multiplexor T . Find 
the optimum axis of {Ub}\/b when the £/& are expressed in the DOL form: 
U b = e^e^^i^+^M^y^. Note that T = A T c Q onv , where T c onv 
is a convenient [7(2)-multiplexor, and Ao is a diagonal unitary matrix that 
incorporates the diagonal matrix factor e irib e l(Tzlb of each b. Now define the "in- 
termediate" matrix T^ nter = TiA . Note that T^ nter is a ?7(2)-multiplexor. In 
general, the product of a U( 2) -multiplexor times a diagonal unitary matrix is 
again a U (2) -multiplexor. 

4. For j = 1,2,... 13, process T* nter in the same way that T was processed. In 
other words, find the optimum axis (for a DOL parametrization) of the £7(2)- 
subset of T* nter . Note that T* nter = AjT™ ni; , where Tj° nv is a convenient 
U (2)-multiplexor, and Aj is a diagonal unitary matrix. Now define the matrix 



After applying the previous steps, we will be able to write U,- in = X^ ter T^3 n1 ' . . . f ^"Tq ™. 
In this expansion of Ui n , all except the last multiplexor are of the convenient type. 
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5. One possibility at this point is to process T l ^ er and then stop. That is, express 
Y* 1 4 ter as a product of a diagonal unitary matrix A 14 and a convenient multi- 
plexor Tf™ . Then express each of the 15 convenient multiplexors T c ° nv for 
j = 0, 1, ... 14 as a SEO with 2 Nb ~ 1 - 1 CNOTs. Finally, expand the diagonal 
unitary matrix A 14 as a SEO with 2 Nb CNOTs, using the technique given in 
Tuc99 for doing this. 

5'. A second possibility is to repeat the previous steps in the reverse direction, this 
time going from left to right, and using DOR parameterizations. Continue to 
sweep back and forth across the sequence of multiplexors. We conjecture that 
after a few sweeps, we will start producing diagonal matrices Aj that are closer 
and closer to unity. When the latest Aj matrix is acceptably close to unity, the 
process can be stopped. At this point, the axes of the multiplexors will have 
reached a kind of equilibrium, and we will have expressed Ui n as a product of 
convenient [7(2)-multiplexors. 

Sweeping only once (ditto, many times) is what we called the Tuc04(NR) 
algorithm (ditto, the Tuc04(R) algorithm) in the Introduction section of this paper. 

For Tuc04(R), Ui n is expressed as product of 2 Nb — 1 convenient ?7(2)-multiplexors, 
each of which is expressed as 2 Nb ^ - 1 CNOTs, so e = {2 Nb - 1){2 Nb - 1 - 1). 

For Tuc04(NR), finding the optimum axis of each [7(2)-multiplexor is un- 
necessary. Doing so changes the final diagonal matrix A 14 , but does not cause it 
to vanish. The lady does not vanish. Thus, for Tuc04(NR), it is best to simply use 
(si, §2, w) = (e x , e y , e z ) throughout. The Tuc04(NR) algorithm is essentially the same 
as the HUT04b algorithm. Tuc04(NR), compared with Tuc04(R), has the penalty of 
having to expand the final diagonal matrix A 14 . This produces an extra 2 Nb CNOTs. 
So for Tuc04(NR), e = (2 Nb - 1)(2 Nb ~ 1 -1)+2 Nb . 

Note that for Tuc04(R), it is not necessary to find very precisely the optimum 
axis of each t/(2)-multiplexor. Any errors in finding such an axis do not increase the 
numerical errors of compiling Ui n . It may even be true that the axes equilibrate as 
long as one provides, each time step E3 above calls for an axis of a U(2)-multiplexor, 
an axis that has a better than random chance of decreasing the cost function defined 
in Appendix IB1 

A Appendix: Parameterizations of 
SU(2) matrices 

In this appendix, we will show how, given orthonormal vectors sj, §2 and w = §i x s 2 , 
and given any SU(2) matrix U, one can find real parameters a,/3, 7 such that U = 
e i-ya Ze i(a(T S1 +f3a S2 ) _ ^y e w jjj uge we \\ known identity 

e is4 = cos9 + i(a ■ 9) sin9 , (17) 
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where a = (a x , a y ,cr z ), 9 is a real 3d vector of magnitude 9, and 9 

Note that given a matrix U G 577(2), if we express its transpose £7 T in the 
form U T = e l ' yaz e' l ( aff ' Sl+ l 3a " S2 \ then this gives an expression for U of the form U = 

e i(a?.«' 1+ ^a-S' 2 ) e i 7 a z where for ■ £ 3^ ^ = ^ ^ = - Sjy} anc l S ^ = Sj2 . (This 

follows from the fact that a x T = a x , a y T = —cr y , a z = & z -) Likewise, given a matrix 
U E SU(2), if we express U(-ia w ) in the form U(-ia w ) = e^'e^'i+^-a), then this 
gives an expression for U of the form U = e l/yaz e^ ar7si+,3a! " 2 \ia w ). 

In the general case, the triad (si,§2,e z ) is an oblique (not orthogonal) basis 
of real 3d space. As warm up practice, consider first the simpler case when the triad 
is orthogonal; that is, when §i = e x , s 2 = e y . Any U G SU{2) can be expressed as 

' , where x, y are complex numbers such that Ixl 2 + \y\ 2 = 1. Thus, we 

//' x 

want to express a, /3, 7 in terms of x, y, where: 



x y 

-y x 



Let 9 = y/a' 2 + f3' 2 . Using Eq. (fT7|) . it is easy to show that 

x = e* 7 cos 9 , 



and 



y = e 



'7 



(P + ia) . 
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sin 9 . 



19a) 



(19b) 



If we assume that cos 9 > 0, then Eqs.()19|l can be easily inverted. One finds 

7 = L{x) , (20a) 



and 



cos 9 = \x\ 



(3 + ia = ^-9 . 
\xy\ 



(20b) 



(20c) 



Next, we consider the general case when the triad (§1, s 2 , e z ) is oblique. One 



has 



x y 
-y* x* 



g*7°"z gi(o!0- sl +/3cr S2 ) 



(21) 



Define 9 by 



9 = asi+ I3s 2 = 9 x e x + 9 y e y + 9 z e z 



(22) 
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Thus, 

9 = ya 2 + (3 2 
Using Eq.(fT7|). it is easy to show that 



% + % + 



x = e i7 (cos6 l + i-^sim 
9 



and 



y 



sin 9 . 



(23) 



(24a) 



(24b) 



We want to express a,/3, 7 in terms of Unlike when the triad was orthogonal, 
now expressing 7 in terms of x, y is non-trivial; as we shall see below, it requires 
solving numerically for the root a non-linear equation. The good news is that if we 
know 7, then a and (3 follow in a straightforward manner from: 



cos# = Re(xe n ) , 



(25) 



and 



y + iO x = (ye 

Given 9 x ,9 y , one can find a, (3 using Eq.(|59|). 



sin 9 



(26) 



Since \x\ + \y\ — 1, Eqs. (j24j) are equivalent to the following 3 equations: 

2 



x 



cos 2 9 ■ [ - I surf/ 



9 



L(x) = 7 + arctan 



'# 2 sin 9 
9 cos 9 



and 



^(2/) = 7 + arctan 



0,, 



As stated previously, 



9 = a§i + 0§2 



(27a) 
(27b) 

(27c) 
(27d) 



Next, we will solve the 6 equations given by Eqs. (|2*7jl for the 6 unknowns (a, (3, 7, 9 X , 9 y , 9 Z ). 
From Eq. ()27djl . it follows that 



Six 


S2x 




a 




9 X 


_ s iy 


S2y _ 




J . 
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Thus, 



a 


1 


J. 


~ A 



s 2y ~ s 2x 

S\y S\ x 



9 X 

On 



(29) 



The determinant A is given by 

A = s lx s 2y - s ly s 2x = si x s 2 ■ e z = w z . (30) 

Substituting the expressions for a, /3 given by Eq.()29|) into the Z component of 
Eq. (j27d|) now yields 



Z = asi z + (3s 2z 

(S 2y x — S 2x 6 y 



/ —Sly6 X + S\ X 9y \ 



where 



k 

W z 



(31a) 
(31b) 
(31c) 

(32) 



for ji G {x, y}. 

At this point, we have reduced our problem to the following 4 equations for 
the 4 unknowns 7, 8 X , 6 y , 6 Z : 



|x| 2 = cos 2 # + ( e -A sin 2 e 



tan(Z(x) — 7) 



8 Z sin 9 
6 cos 6 



and 



tan(Z(y) -7) = 

Uy 



Z k X X hyOy . 



Define the following two shorthand symbols 



Eqs.(j33cD and ()33dj) yield 



tan(Z(x) — 7) 


ty — 


tan(Z(y) 


-7) 


1 ~ty' 






" " 










. e * . 





(33a) 
(33b) 
(33c) 

(33d) 
(34) 

(35) 
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Thus, 



®x 




ty 




ky ~t~ k x ty 


1 



(36) 



Substituting the values for 6 X and 6 y given by Eq. (}3l)j) into the definition of 6 yields: 



9, 



x^y 



J(ky+k X ty) 2 +t 2 y + l 



(37) 



Eqs. (|SSaD and (j33b|) yield 















cos 2 




x 2 


.« -(*) . 




sin 2 








(38) 



Thus, 



cos 2 
sin 2 



(*)■ 



tl) 



t 2 



(39) 



Consider the two components of the vector on the right hand side of the last equation. 
They must sum to one: 



Substituting the value for % given by Eq.(|37|) into Eq. (j4T?|) finally yields 



(40) 



(k y + k x t y ) 2 {\ + t 2 x ) \y\ 2 = (1 + t 2 y )tl\x\ 2 . (41) 

As foretold, in order to find 7 in terms of (x,y), we must solve for the root 7 of a 
nonlinear equation, Eq. (}4~Tj) . 



B Appendix: Optimum Axis 
of C/(2)-subset 

Let {Ub}\/b be a £/(2)-subset. Suppose that we express each JJ h in the form 

U b = e ^ e *™j(°. 1 o*+°. 3 fh,)( i(Tv) y(b) , (42) 

where r]b,ab, Pb,1b are real parameters, where the vectors §i,§2, and w = «i x s 2 
are orthonormal, and where f(b) is an indicator function which maps the set of all 
possible b into {0, 1}. Of course, e ir)b = y/det Ub- Appendix 1X1 shows how to find the 
parameters a&, 7^ for a given Ube~ tVb G Si7 (2). The goal of this appendix is to solve 
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the following minimization problem. If the value of the parameters a b , (3b, lb and the 
vectors §i,s 2 ,w are allowed to vary, while keeping the vectors §x,s 2 ,w orthonormal 
and keeping all Ub fixed, find vectors §x,s 2 ,w that are optimal, in the sense that 
they minimize a cost function. The cost function penalizes deviations of the diagonal 
matrices e llbCrz away from the 2d identity matrix I 2 . Any choice of orthonormal 
vectors si, s 2 will be called strong directions and w = s\ x s 2 will be called a weak 
direction, or an axis of the C/(2)-subset. An axis that minimizes the cost function 
will be called the optimum axis of the £/(2)-subset. 
In Appendix |XJ we used a quantity 6 such that 

Hence, 

Q h = a b h + (3 b s 2 . (44) 

In this appendix, we will find it convenient to use additional symbols rb,p q ,qb, X^i, 
and Xb t 2 which satisfy 

e i(a n a b +a S2 ft) = ^ + - . - ^ (45) 

f b = (hXb,! + s 2 x b , 2 )qb , (46) 

pl + qt = l, (47) 

and 

Xl, + X* 2 = 1 . (48) 

Eq.(HU) expresses 9b in terms of the "fundamental" variables (a&, /3&, 7&, si, s 2 ). Like- 
wise, r b ,p g , qb, X b) \, and X b)2 can be expressed in terms of these fundamental variables 
as follows: 

(a b h + (3 b s 2 ) . 



n= y , ^ sm^al + ft, (49) 



p b = cosJa% + (31 , (50) 



q b = sin^/ag + (3% , (51) 



X M = -f^= , (52) 
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For each b, define a correction Cb by 



C b = e ia ^ b . (54) 

We will use the simple matrix norm \\A\\ = tr(AA^) (i.e., the sum of the absolute value 
of each A entry). We define the cost function (Lagrangian) £ for our minimization 
problem to be the sum over b of the distance between Cb and the 2d identity matrix 
I 2 . Thus, 

£ = EH^-^II (55a) 

b 

= 5>[2-(C6 + <5)] (55b) 

b 

= 4^(1 -cos 7 b ) . (55c) 

b 

The cost function variation is 

5C = 4 5>n( 7b )<5 7 6 • (56) 

b 

The variations d^b represent N b degrees of freedom (dof's), but they are not inde- 
pendent dofs, as they are subject to the following constraints. For all b, Ub is kept 
fixed during the variation of C, so 



SU b = (ia z 5 lb )Ub + e iVb e ia ^ b (dp b + id ■ 5f b )(ia w ) m + U b (if(b)a ■ 8w) = . (57a) 

(We've used the fact that f(b) G {0, 1}). The vectors si and s 2 are kept orthonormal 
(i.e., Sj ■ Sk = 5(j, k) for all j, k e {1, 2}) during the variation of £, so 

6(8, ■ h) = (57b) 

for j, k G {1,2}. Finally, the points ipb,Qb) an d (X bl ,X b2 ) are constrained to lie on 
the unit circle, so 

PbSpb + qbSqb = , (57c) 

and 

X bl 5X bl + X b2 5Xb2 = . (57d) 

Eq. (|57a|) represents 3N b constraints. Eq. (j57bjl represents 3 constraints. Eq. ()57cj) 
and Eq. (j57d|) together represent 2N b constraints. Thus, Eqs. ljBTjl altogether represent 
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5N b + 3 (scalar) equations in terms the 5N b + 6 (scalar) unknowns (the unknowns 
are: 3 components of 8§i, 3 components of Ss 2 , and, for all b, 5%, 5p b , Sq b , SX bl , 5X b2 ). 
Therefore, there are really only 3 independent dofs within these 5N b + 6 variations. 
Next, we will express 5C in terms of only 3 independent variations (for independent 
variations, we will find it convenient to use w ■ 5§i,w ■ 5s 2 and §i ■ Ss 2 ). Once 5C 
is expressed in this manner, we will be able to set to zero the coefficients of the 3 
independent variations. 

Eq. (j57a|) implies the following 4 equations: (we use the fact that pi + ff = 1) 

- r bz S% - n - f{b)f b ■ 5w = , (58a) 
Pb 

and 

h b 5-f b + 5r b + e b = } (58b) 

where 

h = [r by , -r bx ,p b ] T , (59) 

and 



e b = f{b)(p b 5w - r b x 5w) . (60) 

Eqs. (13H|) constitute 4 constraints, but only 3 are independent. Indeed, if one dot- 
multiplies Eq. (j58b|) by r b , one gets Eq. (j58aj) . So let us treat Eq. (j58a|) as a redundant 
statement and ignore it. Dot-multiplying Eq. (j58b|) by si, s 2 and e z separately, yields 
the following 3 constraints: 

— * 

(s fc • h b )5^ b + s k ■ 5f b + s k ■ e b = , (61a) 

for j G {1, 2}, and 

Pbhb + Sr bz + e bz = . (61b) 

Now we proceed to express 5f b in terms of 5si and 5s2- From the definition 
r b = Y^j=i SjX b jq b , we immediately obtain 

2 2 

5f b = ^(6sj)X bj q b + SjS(X bj q b ) . (62) 



Hence 



h ■ 5r b = J2(h ■ 5s j )X bj q b + J2§ k - Sj5(X bj q b ) 
j j 
= ^2(sk- Ssj)X bj q b + S(X bk q b ) , (63a) 
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and 

8 n z = ^2($Sjz)X bj q b + s jz 5(X bj q b ) . (63b) 

If we substitute the expressions for s k • 5r b and 5r bz given by Eqs.(|6^|) into 
Eqs.flfiT]). we get 

{h ■ h)5j b + J2(s k ■ 6sj)X bj q b + 5(X bk q b ) + s k ■ e b = , (64a) 

j 

and 

Pbhb + J2(5s jz )X bj q b + s jz 5(X bj q b ) + e bz = . (64b) 

3 j 

Substituting the expression for 5(X b jq b ) given by Eq. (j64aj) into Eq. (|64b|) yields 

A b 8 lb = B b , (65) 



where 



A b = Pb - s iz h b ' % > ( 66 ) 



and 



Thus, 



B b = - J2(5s jz )X bj q b + ^2 s jz (sj ■ Ss k )X bk q b + ^ s jz §j ■ e b . (67) 

j j,k j 

5C = 4^sin( 7fe )5 7b = 4]Tsin( 7b )^ . (68) 

6 b A b 

We have succeeded in expressing SC in term of the 9 variations S§i, 6s 2 , 6w of 
the strong and weak directions. But not all of these 9 variations are independent due 
to the orthonormality of Si,§2,w. Our next goal is to express these 9 variations in 
terms of 3 that can be taken to be independent. 

For j 6 {1, 2}, Sj ■ w = so 

§j ■ 5w = —w ■ 5§j . (69) 

Note that 

§i ■ s 2 x 5w = (§i x s 2 ) • Sw = w ■ 5w = . (70) 

Thus, 

§j ■ s k x 5w = (71) 

for any j, k G {1, 2}. Hence, 
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It follows that 



§j ■ f b x 5w = 



(72) 



Sj ■ e b = -f(b)p b w ■ 5Sj . (73) 

Define A, Ai, A 2 by 

A = h ■ 5s 2 (74) 

and 

Xj — w ■ 5§j (75) 

for j G {1, 2}. One can always expand 5§i and Ss 2 in the orthonormal basis (si, s 2 , w). 
The constraints 5(s 3 - • §k) = for j, G {1, 2}, force such expansions to be: 

6§i = —X§2 + XiW , (76a) 

and 

5s 2 = X§i + X 2 w . (76b) 
Using Eqs.([73|) and (J75J) . as given by Eq. fJBTj) can be re- written as 

B b = -q b w z X bj Xj - f(b)p b ^ SjzXj . (77) 

3 3 

Substituting this expression for B b into Eq.(|HS|l for 5C gives a new expression for SC. 
In the new expression for 5C, we may set the coefficients of Ai,A2,A separately to 
zero. This yields: 

= J2 sin ^b[qbW z X bj + f(b)pbS jz ] ^ 
b Pb - Ej Sjzh ■ Sj 

for je {1,2}. 

Next, we want to solve the 2 equations Eqs.(|78|) for the direction w. As in 
Appendix 1X1 let = w^/w z for \i G {x, y}. Then 

w = p^M^= . (79a) 

Y 1 + + 

We can always assume that = 0. If we do so, then 

h = , (79b) 

and 
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S 2 = W X Si 



(79c) 



Suppose we denote the two constraints of Eq. (J78j) by F\ = 0, F 2 = 0. These two 
constraints depend on the set of variables V = §2} U {a&, /3&, 7&}vf>- Using Eqs.(f79*j) 
and the results of Appendix |X[ the variables V can all be expressed in terms of k x , k y 
and {£4}vb- Thus what we really have is Fj(k x , k y , {Ub}vb) = for j e {1, 2}. These 
two equations can be solved numerically for the two unknowns k x ,k y . 
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